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1) Title: METHOD OF TRANSMITTING AND RECEIVING COMPRESSED TELEVISION SIGNALS 




(57) Abstract 

The invention relates to the transmission of MPEG encoded television signals from a Video-On-Demand server (1) to a receiver 
(2) via a network (3). Non-linear playback functions such as "pause" and "resume" require a very accurate control of the bit strearA, 
taking account of typical network aspects such as network latency and remultiplexing. In order to allow the receiver to flawlessly resun >• 
signal reproduction after a pause, position labels (p; 5,6; 7a, 7b, 8a, 8b) are inserted into the bit stream at positions where the server ci 
resume transmission of the signal after an interruption. Upon a pause request, the decoder initially continues the reproduction until such 
position label is detected. The subsequent bits deliverd by the network are ignored, i.e. they are thrown away. Upon a request to resum 
reproduction, the receiver requests the server to retransmit the signal starting at the detected position. 
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Methods of transmitting and receiving compressed television signals. 



FIELD OF THE INVENTION 

The invention relates to methods of transmitting and receiving compressed 
television signals. The invention also relates to arrangements for carrying out said methods, 
more particularly, arrangements such as video-on-demand servers for transmitting, and set 
5 top boxes for receiving, MPEG compressed audio and video signals. 

BACKGROUND OF THE INVENTION 

The so-called "Transport stream" and "Program Stream" as defined within 
the MPEG-2 standard provide a standardized mechanism for the transmission and storage of 

10 MPEG compressed audio and video (A/V) information. Both streams contain time stamps to 
synchronize the decoding and presentation of the conveyed audio and video information while 
ensuring that data buffers in the decoders do not overflow or underflow during linear 
playback. In addition, these time stamps are used for the correct delivery of the streams 
themselves. The MPEG Transport Stream is tailored to the communication or the storage of 

15 one or more programs of digital data in environments in which significant errors may occur. 
Such environments are for example terrestrial and satellite broadcast channels, cable 
channels, and other telecommunication networks. The MPEG Program Stream is tailored to 
communication or storage of one program of digital data in environments where errors are 
very unlikely, and where software processing of system coding is a major consideration. 

20 Such environments are, for example, PCs with optical and magnetic storage media. 

In the last few years, significant effort has been devoted to applications 
that involve the linear playback and transmission of Program and Transport Streams. For 
example, broadcasting MPEG-2 encoded programs involves the distribution of Transport 
Streams of infinite length. However, the non-linear playback of MPEG Transport Streams 

25 and Program Streams from video servers has not received the same level of attention. Non- 
linear playback involves the interruption and continuation of the stream and is necessary for 
basically all kinds of trick modes. Trick-modes require an accurate control of the stream. For 
local disc-based systems, solutions are known to support trick-modes such as pause, slow 
motion, and fast forward. However, the known solutions can not be applied to network 

30 environments, because networks differ from local systems in two aspects. 
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The first aspect is the delay decoder-network-server-network-decoder. 
This delay, which is further referred to as the network latency, is not constant and can have 
a magnitude of up to one second. A consequence of the network latency is that after the end- 
user requests a pause (resulting in a stall of the decoding and display process and sending a 
5 pause command to the server), the network will continue to deliver bits to the decoder for 
some period of time. The subsequent continuation of the stream is also complicated by the 
network latency. Namely, because of the network latency there is a significant delay between 
the transmission of the 'Resume' command to the server and the arrival of the requested bits 
at the decoder. The bits delivered after the pause command and the temporary lack of bits 

10 after the resume command have to be handled by the decoder in such a way that the decoder 
can flawlessly resume its decoding and display process. 

The second aspect is the possible presence of MPEG-2 remultiplexers in 
the network. The presence of remultiplexers complicates the support for trick-modes because 
remultiplexers allow the packets having different PIDs in a Transport Stream to be 

15 reordered. A PID is a Packet IDentifier as defined in the MPEG-2 standard. In practice this 
means that the packets do not enter the decoder in the same order as they were sent by the 
server. Note that the presence of remultiplexers does not complicate the trick-mode support 
for Transport Streams with only one PID, e.g. video, because remultiplexers are not allowed 
to reorder the packets within one PID. 

20 

OBJECT AND SUMMARY OF THE INVENTION 

It is an object of this invention to provide trick-mode support for MPEG-2 
Transport Streams and Program Streams in network environments. 

To that end, the invention provides a method of transmitting a compressed 
25 television signal, characterized by inserting position labels into said signal at positions where 
the transmission of said signal can be resumed after an interruption. 

The invention is based on the strategy that a decoder throws away all bits 
that are delivered by the network after the 'Pause' command has been issued. For a 
successful continuation of the stream, the decoder has to know exactly what the position in 
30 the stream was (e.g. how many bits have been received from the server) when the pause was 
effected. This is necessary to instruct the server exactly at which point to continue in order 
to keep the MPEG-2 data consistent. Otherwise part of an MPEG frame would be duplicated 
or missing. Normally, the server cannot exactly determine the stream position at the time the 
pause was effected at the decoder due to the latency in the communication between decoder 
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and server. By transmitting unique position labels in each of the PIDs of the Transport 
Stream, decoders can precisely determine which bits are stored and which are thrown away. 
The continuation of the stream involves the request for new bits starting at the appropriate 
position label. 

5 The invention concentrates on two trick-modes, namely: 'pause/continue' 

and 'slow motion'. The required functionality for 'pause and continue' is that a user can 
pause the display at any point in time and can continue flawlessly afterward. Preferably, 
interruption takes effect immediately e.g. at the frame/field which is currently displayed and 
continuation should occur at the next frame without any missing or duplicated frames. 'Pause 

10 and continue' involves the transmission of 'Pause' and 'Resume* commands from the decoder 
to the server to stop and continue the stream. The required functionality for 'slow motion' is 
that a user can visualize the stream at a speed lower than the normal playback speed. The 
decoder should be able to switch between the slow motion mode and normal playback mode. 
The switch should take effect immediately without any missing or duplicated frames. 'Slow 

15 motion' also involves the transmission of 'Pause' and 'Resume' messages from the decoder 
to the server to stop and start the stream. 

Several fields in the MPEG-2 standard can be used to convey the unique 
position labels. The position labels can be accommodated in a separate stream of intermediate 
control packets, for example, using the concept of MPEG stream descriptors. The PCR fields 

20 of each PID stream in the Transport Stream can be used when the PCR information is 
transmitted in a separate stream. PCR means Program Clock Reference as defined in the 
MPEG-2 standard. Accommodating the position labels in a separate stream of control packets 
works well if there are no remultiplexers in the network, i.e. if packets having different PIDs 
cannot be reordered. 

25 The position labels can also be accommodated in the signal (audio, video, 

etc.) packets themselves, for example, in the private data part of adaptation fields of 
particular packets. This embodiment also works for networks comprising remultiplexers. For 
all cases, the repetition rate of the position labels should be high enough to allow for a fine 
control of the stream. The position labels of different PIDs that belong together can be 

30 identified by a particular time code. 

The corresponding method of receiving a digital television signal is 
characterized by the steps of: sending a request to the transmitter to interrupt transmission of 
the signal; interrupting the reproduction of the signal upon detection of a position label 
accommodated in the received bit stream and throwing away the received signal following 
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said position label; sending a request to the transmitter to resume transmission of the signal 
from the position defined by said position label; and resuming the reproduction of the signal 
upon receipt thereof. 

The continuation of the stream involves a delay equal to the network 
5 latency. The delay can be circumvented by using a latency buffer whose size is equal to the 
maximum network latency times the bit rate of the stream. This approach works for both 
'Pause and Continue' trick modes as well as for the 'Slow Motion' trick mode. With such a 
latency buffer, the decoder may pause the reproduction of the signal immediately and keep 
the bits up to a later received position label available in the buffer. Only the bits following 
10 said later received position label are now thrown away. Upon the resume command, the 
reproduction of the still available bits is resumed and the server is requested to resume 
transmission of the bits following the later received position label. 

These and other aspects of the invention will be apparent from and 
elucidated with reference to the embodiments described hereinafter. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 shows a system comprising a Video-On-Demand server and a set 
top box in accordance with the invention. 

Fig.2 shows an example of a television signal transmitted in accordance 
20 with one embodiment of the invention. 

Fig.3 shows a flowchart of operation steps carried out by the server and 

the set top box. 

Fig.4 shows a timing diagram illustrating the operation of the server and 

the set top box. 

25 Fig.5 shows an example of how the television signal shown in Fig.2 may 

be received by the set top box. 

Fig. 6 shows an example of a television signal transmitted in accordance 
with a further embodiment of the invention. 

Fig. 7 shows an example of how the television signal shown in Fig.6 may 
30 be received by the set top box. 

DESCRIPTION OF PREFERRED EMBODIMENTS 

Fig.l shows a system comprising a Video-On-Demand (VOD) server 1 
and a set top box (STB) 2 connected via a network 3. The VOD-server comprises a storage 
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medium 11 and a VOD-controller 12. The STB comprises an MPEG decoder 21, an STB- 
controller 22 and a (remote) control unit 23, and is connected to a reproduction device 4. 
User commands from remote control unit 23 are applied to STB-controller 22 and, as far as 
they require actions to be taken by the server, transmitted to the VOD-controller 12 via the 
5 network 3 as a control signal CTRL. A selected television program is transmitted from the 
server 1 to STB 2 in the form of an MPEG Transport Stream TS. 

A plurality of television programs is stored on storage medium 11 which 
is usually an airay of hard disks. From the STB point of view, the server behaves as a 
remote video recorder. The stored television programs can be played back at various speeds, 

10 paused and resumed. More particularly, the server can be instructed to playback the program 
starting at a given "tape" position. The operational command therefor will hereinafter be 
referred to as "Resume(p)" in which p identifies the position at which transmission of the 
signal can be resumed. The number of positions where playback of the program can be 
resumed is restricted, depending on how the signal is physically stored. For example, the 

15 positions have to correspond with disk sector or disk cylinder boundaries. Because the signal 
is compressed, i.e. the amount of disk space varies from picture to picture, resume positions 
are irregularly distributed over playback time. They may be several seconds apart. 

The resume positions are known to the VOD-controller 12. As is shown 
in Fig.l, the server 1 further comprises a packetizer circuit 13. This circuit receives an 

20 MPEG-compressed television signal from storage medium 11 and generates a stream of 
transport packets of Fixed length. The packetizer also receives the resume positions p from 
VOD-controller 12 and inserts them in the transport stream in a manner which will be 
described later. The positions p can be expressed in various ways. For example, p may be an 
ordinary integer number. In the present example, p will be assumed to be a time code 

25 corresponding with the normal playback time. 

Generally, television programs comprise a video signal, several audio 
signals and additional data for, inter alia, subtitles in various languages. For simplicity, it 
will be assumed here that the television program which is transmitted from the server to the 
STB comprises one audio signal A and one video signal V only. Accordingly, the transport 

30 stream comprises audio packets and video packets. Each packet comprises a header and a 
payload. The header comprises a Packet IDentifier (PID) which identifies whether the packet 
carries audio data or video data. Note that the Transport Stream TS may comprise various 
television programs. A Program Map Table is transmitted to indicate which PIDs constitute 
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the relevant program. In the present example, the Transport Stream TS conveys the selected 
television program only. 

Fig.2 shows an example of the Transport Stream TS which is transmitted 
by the server to the STB in accordance with an embodiment of the invention. The Transport 
Stream includes audio packets A t , A 2 , A 3 , etc. and video packets V 2l V 3 , etc. At the 
resume positions defined above, the Transport Stream includes position labels in the form of 
control packets C(p), two of which are shown by reference numerals 5 and 6. The PID of 
said control packets is associated with the relevant television program by means of the 
Program Map Table described above (control packets associated with a different television 
program have a different PID). In the pay load of the packets, the relevant resume position p 
is encoded in the form of a time code. It should be noted that Fig.2 is not drawn to scale. In 
practice, more audio and video packets are transmitted between two successive position 
labels than are shown in the Figure. 

The operation of the server and the STB will now be explained 
collectively with reference to Fig.3, which shows a flowchart of operation steps carried out 
by the STB (left side of the Figure) and the server (right side of the Figure). In a step 31 at 
t=t lt the STB receives a pause command. However, the STB continues to decode and 
reproduce the received signal until a position label p is detected. Then, the reproduction of 
the signal is stalled in a step 32, at t=t 2 , i.e. the video display is frozen and the audio 
reproduction is stopped. 

In the step 32 the STB also forwards the pause command to the server. 
Transmitting the pause command to the server upon detection of the position label instead of 
immediately transmitting it guarantees that there is at least one position label present in the 
received signal. The pause command is received by the server in a step 33, at 1=13, with a 
delay r t , due to the latency of the network. In response to the pause command, the server 
instantaneously interrupts the transmission of the signal. 

Step 34 denotes the interruption of the signal reception at the receiver 
end, at t=t 4 , due to the latency t 2 of the network. In a step 35, the STB throws away all the 
received signal data following the position label that has been detected in the step 32. 

In a step 36, at t=t s , the STB receives a command from the user to 
resume the reproduction and transmits a "Resume(p)" command to the server in which p 
denotes the position where the reproduction was stalled. The resume command is received by 
the server in a step 37, at t=t6, with a delay r 3 due to the latency of the network. In 
response to the resume command, the server resumes signal transmission starting at the 



WO 97/04585 ? PCT/IB96/00707 

position defined by p. With a delay t 4 , the signal arrives at the STB which resumes the 
reproduction in a step 38, at t=t 7 . It should be noted that the delays r r T 4 are not necessarily 
equal. They neither need to be constant over time. 

Fig. 4 shows a timing diagram of signals at various points in the system. 
More particularly, TS1 denotes the Transport Stream as transmitted by the server, TS2 
denotes the Transport Stream as received by the STB, and DSP denotes the reproduced 
signal. At t=t„ the pause command is received. At t=ta, a position label 40 is detected. In 
response thereto, the reproduction is stalled and the packets 41 delivered by the network after 
the label are thrown away. At t=^, the server receives the pause command and interrupts the 
signal transmission which becomes manifest at the receiver end at t=t,. At t=t s , the STB 
receives the resume command. At 1=1$, the signal transmission is resumed starling with the 
packets 41 and, at t=ty, the reproduction is continued flawlessly. 

The MPEG systems specification allows the order of packets having 
different PIDs in a Transport Stream to be changed. This will be the case if the network 
between server and STB comprises remultiplexers. For example, the Transport Stream shown 
in Fig.2 may be received by the STB as a Transport Stream shown in Fig.5. The transmitted 
packet sequence Vj-Cj-Aj-V, is now received as A 3 -V g -V 9 -C 2 which is still a valid MPEG 
sequence. In accordance with the strategy described above, the STB will now stall the 
reproduction after decoding A 3 ,V 9 rather than A 2 ,V S . However, the server will retransmit 
these packets after an interruption. Needless to say that the STB cannot flawlessly resume the 
reproduction of the signal. 

A further embodiment of the invention will now be described in which the 
remultiplexing aspect is taken into account. In this embodiment, the VOD-controller 12 
(Fig.l) is adapted to accommodate the position labels in the audio and video packets 
themselves rather than inserting them in intermediate control packets which have a different 
PID. Preferably, the position labels are accommodated in the adaptation field which the 
MPEG standard provides. An adaptation field is an optional data field following the header 
of a transport packet. Two bits in the header indicate whether or not an adaptation field is 
present and whether or not the adaptation field is followed by payload. The MPEG standard 
allows private data bytes to be accommodated in the adaptation field. Preferably, these 
private data bytes carry the position label information. 

Fig. 6 shows an example of the Transport Stream TS which is transmitted 
by the server in accordance with this embodiment. In this example, position labels are 
accommodated in audio packets 7a and 8a. The same position labels are accommodated in 
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video packets 7b and 8b, respectively. Fig.7 shows how this Transport Stream may be 
received by the STB due to remultiplexing. 

The operation of the STB is essentially the same as described before. 
After a pause command, the STB continues to reproduce the video signal until a packet (for 
5 example, video packet V 5 denoted 7a) comprising a position label p is detected, and 

continues to reproduce the audio signal until an audio packet comprising the same position 
label p (in this case packet A 2 denoted 7b) is detected. Thus, irrespective of whether the STB 
receives the original Transport Stream shown in Fig. 6 or the remultiplexed Transport Stream 
shown in Fig.7, video reproduction is interrupted after decoding V 4 and audio reproduction is 
10 interrupted after decoding A,. Upon resumption of the operation, the STB will send the 

H Resume(p)" command to the server. In response thereto, the server will restart transmission 
of the signal with A 2 and V 5 and the decoder will flawlessly resume the reproduction of both 
signals. 

In summary, the invention relates to the transmission of MPEG encoded 
15 television signals from a Video-On-Demand server (1) to a receiver (2) via a network (3). 
Non-linear playback functions such as 'pause' and 'resume' require a very accurate control 
of the bit stream, taking account of typical network aspects such as network latency and 
remultiplexing. In order to allow the receiver to flawlessly resume signal reproduction after a 
pause, position labels (p; 5,6; 7a,7b,8a,8b) are inserted into the bit stream at positions where 
20 the server can resume transmission of the signal after an interruption. Upon a pause request, 
the decoder initially continues the reproduction until such a position label is detected. The 
subsequent bits delivered by the network are ignored, i.e. they are thrown away. Upon a 
request to resume reproduction, the receiver requests the server to retransmit the signal 
starting at the detected position. 
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1, A method of transmitting a compressed television signal, characterized by 

inserting position labels into said signal at positions where the transmission of said signal can 
be resumed after an interruption. 

5 2. The method as claimed in Claim 1, in which the signal is transmitted as a 

series of packets, characterized in that the position labels are accommodated in intermediate 
control packets. 

3, The method as claimed in Claim 2, characterized in that the position 
10 labels are transmitted as MPEG stream descriptors in said control transport packets. 

4, The method as claimed in Claim 1, in which the signal is transmitted as a 
series of packets, characterized in that the position labels are accommodated in said packets. 

15 5. The method as claimed in Claim 4, in which the position labels are 

transmitted as private data bytes in the adaptation field of MPEG transport packets. 

6. A method of receiving a compressed television signal from a transmitter, 
characterized by the steps of: 

20 sending a request to the transmitter to interrupt transmission of the signal; 

interrupting the reproduction of the signal upon detection of a position label 
inserted into said signal, and throwing away the received signal following said position label; 

sending a request to the transmitter to resume transmission of the signal from 
the position defined by said position label; and 
25 resuming the reproduction of the signal upon receipt thereof. 

7. A transmitter for transmitting a compressed television signal, 
characterized by means for inserting position labels into said signal at positions where the 
transmission of said television signal can be interrupted and resumed. 



WO 97/04585 1Q PCT7IB96/00707 

g. The transmitter as claimed in Claim 7, in which the signal comprises a 

series of sequential data packets, characterized in that the transmitter is adapted to 
accommodate said position labels in intermediate control packets. 

5 9_ The transmitter as claimed in Claim 8, characterized in that the position 

labels are transmitted as descriptors in MPEG transport packets. 

10. The transmitter as claimed in Claim 7, in which the signal comprises a 
series of sequential data packets, characterized in that the transmitter is adapted to 

10 accommodate said position labels in selected ones of said signal packets. 

11. The transmitter as claimed in Claim 10, in which the position labels are 
transmitted as private data bytes in the adaptation field of MPEG transport packets. 

15 12. A receiver for receiving a compressed television signal, characterized in 

that the receiver comprises control means adapted to: 

send a request to a transmitter to interrupt transmission of the signal; 
interrupt the reproduction of the signal upon detection of a position label 
inserted into the signal, and throw away the received signal following said position label; 
20 send a request to the transmitter . to resume transmission of the signal from the 

position defined by said position label; and 

resume the reproduction of the signal upon receipt thereof. 



25 



13. A compressed television signal, characterized by including position labels 

inserted at positions where the transmission of said signal can be resumed after an 
interruption. 
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